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TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN that We, Peter D. Costigan Jr. of Los Gatos, California and citizen of the 
United States, Laurence E. England of Morgan Hill, California and citizen of the United States, 
and James R. Rhyne II of Los Gatos, California and citizen of the United States have invented 
new and useful improvements in 



METHOD, SYSTEM, COMPUTER PROGRAM PRODUCT, AND ARTICLE OF 
MANUFACTURE FOR CONSTRUCTION OF A COMPUTER APPLICATION 
INTERFACE FOR CONSUMPTION BY A CONNECTOR BUILDER 

of which the following is a specification: 
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METHOD, SYSTEM, COMPUTER PROGRAM PRODUCT, AND ARTICLE OF 
MANUFACTURE FOR CONSTRUCTION OF A COMPUTER APPLICATION 
INTERFACE FOR CONSUMPTION BY A CONNECTOR BUILDER 



A portion of the Disclosure of this patent document contains material which is subject 
to copyright protection. The copyright owner has no objection to the facsimile reproduction by 
anyone of the patent document or the patent disclosure, as it appears in the Patent and 
Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever. 
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BACKGROUND OF THE INVENTION 



1. Field of the Invention 

The present invention relates in general to software re-engineering, and more 
particularly to identifying an interface of an application program. 

2. Description of the Related Art 

With the advent of the World Wide Web (Web), many enterprises prefer to leverage 
existing mainframe application programs by connecting a Web front-end to those existing 
mainframe application programs. These mainframe application programs may include 
transaction systems such as the IBM® Customer Information Control System (CICS®), the 
IBM® Information Management System (IMS), or the IBM® DB2® relational database 
system. (IBM®, CICS®, and DB2® are registered trademarks of International Business 
Machines Corporation in the United States, other countries, or both.) To bridge between the 
Web front-end and the existing mainframe application program, a block of code, commonly 
known as a connector, is generated. Tools that generate a connector are available from various 
manufacturers, such as the IBM® Enterprise Access Builder, IBM® WebSphere Application 
Development Studio, IBM® VisualAge® Interspace, IBM® Visual Age® for Java®, or 
Microsoft® COM Transaction Integrator (COMTI). (VisualAge® is a registered trademark of 
International Business Machines Corporation in the United States, other countries, or both. 
WebSphere is a trademark of International Business Machines Corporation in the United 
States, other countries, or both. Java and all Java-based trademarks are trademarks of Sun 
Microsystems, Inc. in the United States, other countries, or both. Microsoft is a trademark of 
Microsoft Corporation in the United States, other countries, or both.) To generate a connector 
from a specified source code file comprising a mainframe application program, a connector 
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tool parses the source code contained in a specified source code file to obtain the information 
necessary to generate the connector. This conventional approach may generate the appropriate 
connector if the specified source code file contains the complete information necessary to 
generate the connector. However, the specified source code file may not contain any of the 
necessary information or may only contain a portion of the necessary information. For 
example, it may be a COBOL source code file containing a COPY statement which points to 
another source code file, a COBOL COPYBOOK, which actually contains the necessary 
information. Typically, this complete information is contained in and dispersed among 
multiple source code files. Thus, there is a need for a connector tool which can generate a 
connector based upon information dispersed among multiple source code files. 

Conventional connector generators are also language dependent, such as a Java-to- 
COBOL connector generator, a Java-to-CICS connector generator, or a C++-to-COBOL 
connector generator. The Java-to-COBOL connector generator cannot be used to generate a 
C++-to-COBOL connector, and the C++-to-COBOL connector generator cannot be used to 
generate a Java-to-COBOL connector. Thus, generating a second connector for a second 
language involves a repetition of the identification, parsing, selection, and editing steps which 
were performed for the generation of the first connector. Even if the same connector is being 
re-generated, i.e., the second connector is the same language as the first connector, the steps are 
repeated. The reuse of prior parsing, identification, and editing is difficult as conventional 
systems fail to store this information in a form which is consummable by a connector building 
tool. Thus, there is a need for a connector tool which supports multiple target languages, and 
which reuses information and analysis from one generation to the next or from one language to 
another language. 

Conventional methods have failed to provide solutions to these problems. Thus, there 
is a clearly felt need for a method, system, article of manufacture, and computer program 
product for providing improved identification of an interface of an application program 
dispersed among multiple files for consumption by a connector building tool. 
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SUMMARY OF THE INVENTION 



The present invention comprises a method, system, computer program product, and 
article of manufacture for identifying an interface of an application program. The source code 
of the application program is parsed to identify meta information, and the meta information and 
a link pointing to an original location of the meta information within the application program 
are stored in a repository. A user may then be allowed to query the repository to determine 
which source files and which interfaces comprise the application program. Responsive to the 
query, a new source file is constructed which contains the interfaces comprising the application 
program. The new source file and a link pointing to a location of the new source file are also 
stored in the repository. A meta language document is constructed which contains a 
description of the application program interfaces to enable a connector building tool to build 
an interface to the application program. 

One aspect of a preferred embodiment of the present invention parses source code of an 
application program to identify meta information. 

Another aspect of a preferred embodiment of the present invention stores the meta 
information and a link pointing to an original location of the meta information within the 
application program in a repository. 

Another aspect of a preferred embodiment of the present invention allows a user to 
query the repository to determine which source files and which interfaces comprise the 
application program. 

Another aspect of a preferred embodiment of the present invention constructs a new 
source file which contains the interfaces comprising the application program. 

Another aspect of a preferred embodiment of the present invention stores in the 
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repository the new source file and a link pointing to a location of the new source file. 

Another aspect of a preferred embodiment of the present invention constructs a meta 
language document which contains a description of the application program interfaces to 
enable a a connector building tool to build an interface to the application program. 

A preferred embodiment of the present invention has the advantage of providing 
improved identification of an application program's interface in a language-independent 
manner. 

A preferred embodiment of the present invention has the further advantage of 
identifying an application program's interface dispersed among multiple files. 

A preferred embodiment of the present invention has the further advantage of 
providing a single source file containing the interfaces comprising the application program. 

A preferred embodiment of the present invention has the further advantage of providing 
an identification of the interfaces comprising the application program which relieves a 
connector building tool of the burden of parsing multiple different languages. 

A preferred embodiment of the present invention has the further advantage of providing 
improved usability in a tool for identifying application program interfaces. 

A preferred embodiment of the present invention has the further advantage of providing 
improved functionality in a tool for identifying application program interfaces. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



For a more complete understanding of the present invention and the advantages thereof, 
reference is now made to the Description of the Preferred Embodiment in conjunction with the 
attached Drawings, in which: 

Figure 1 is a block diagram of a distributed computer system used in performing the 
method of the present invention, forming part of the apparatus of the present invention, and 
which may use the article of manufacture comprising a computer-readable storage medium 
having a computer program embodied in said medium which may cause the computer system 
to practice the present invention; 

Figure 2 is a block diagram of a preferred embodiment of the present invention; 

Figure 3 and Figure 4 are flowcharts illustrating operations preferred in carrying out 
the preferred embodiment of the present invention; and 

Figure 5, Figure 6, and Figure 7 are graphical user interfaces preferred in carrying out 
a user interface 296 of the preferred embodiment of the present invention. 
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■ 1 

1 DESCRIPTION OF THE PREFERRED EMBODIMENT 

2 

3 An embodiment of the invention is now described with reference to the figures where 

4 like reference numbers indicate identical or functionally similar elements. Also in the figures, 

5 the left most digit of each reference number corresponds to the figure in which the reference 

6 number is first used. While specific configurations and arrangements are discussed, it should 

7 be understood that this is done for illustrative purposes only. A person skilled in the relevant art 

8 will recognize that other configurations and arrangements can be used without departing from 

9 the spirit and scope of the invention. It will be apparent to a person skilled in the relevant art 
10 that this invention can also be employed in a variety of other devices and applications. 

11 

12 Referring first to Figure 1, there is depicted a graphical representation of a data 

y| processing system 8, which may be utilized to implement the present invention. As may be • 

! **, i 

Iff seen, data processing system 8 may include a plurality of networks, such as Local Area 

\S> Networks (LAN) 10 and 32, each of which preferably includes a plurality of individual 

16 computers 12 and 30, respectively. Alternatively, networks 10 and 32 may be intranets or 

ff portions of the internet. Of course, those skilled in the art will appreciate that a plurality of 

HI Intelligent Work Stations (IWS) coupled to a host processor may be utilized for each such 

|£ network. Each said network may also consist of a plurality of processors coupled via a 

|5 communications medium, such as shared memory, shared storage, or an interconnection 

•ass? 

35 network. As is common in such data processing systems, each individual computer may be 

22 coupled to a storage device 14 and/or a printer/output device 16 and may be provided with a 

23 pointing device such as a mouse 17. 
24 

25 The data processing system 8 may also include multiple mainframe computers, such as 

26 mainframe computer 18, which may be preferably coupled to LAN 10 by means of 

27 communications link 22. The mainframe computer 18 may also be coupled to a storage device 

28 20 which may serve as remote storage for LAN 10. Similarly, LAN 10 may be coupled via 

29 communications link 24 through a sub-system control unit/communications controller 26 and 
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1 communications link 34 to a gateway server 28. The gateway server 28 may be an IWS which 

2 serves to link LAN 32 to LAN 10. Preferably, server 28 is a web application server which 

3 passes transactions from a requester 30 on the internet 32 to the mainframe 18 upon which a 

4 back-end application serving the transaction is executing. 
5 

6 With respect to LAN 32 and LAN 10, a plurality of documents or resource objects may 

7 be stored within storage device 20 and controlled by mainframe computer 18, as resource 

8 manager or library service for the resource objects thus stored. Of course, those skilled in the 

9 art will appreciate that mainframe computer 18 may be located a great geographic distance 

10 from LAN 10 and similarly, LAN 10 may be located a substantial distance from LAN 32. For 

1 1 example, LAN 32 may be located in California while LAN 10 may be located within North 
g Carolina and mainframe computer 18 may be located in New York. 

d 

M 

U Software program code which employs the present invention is typically stored in the 

|JJ memory of a storage device 14 of a stand alone workstation, LAN server, or host from which a 

W developer may access the code for distribution purposes, the software program code may be 

ty 

lj embodied on any of a variety of known media for use with a data processing system such as a 

||l diskette or CD-ROM or may be distributed to users from a memory of one computer system 

W over a network of some type to other computer systems for use by users of such other systems. 

2§ Such techniques and methods for embodying software code on media and/or distributing 

2f software code are well-known and will not be further discussed herein. 
22 

23 As will be appreciated upon reference to the foregoing, it is often desirable for a user 

24 to link an application program on the mainframe 18 to the internet 32 and/or World Wide Web, 

25 where the application program was not originally designed for Web or internet based 

26 transactions. A preferred embodiment of the present invention assists the user in performing 

27 such a Web-enablement adaptation of the application program by providing improved 

28 identification of an interface of an application program dispersed among multiple files for 

29 consumption by a connector building tool. 



STL920000081US1 



Referring now to Figure 2, a block diagram of a preferred embodiment of the present 
invention, known as a Connector Builder Assistant (CBA), is illustrated. The primary 
objective of the Connector Builder Assistant is to assist a user in the finding of useful 
application program assets for Web-enablement and the building of connectors from various 
environments (e.g., Java, C++) to these mainframe application program assets (such as 
COBOL, CICS, IMS, or DB2 transactions). Although the preferred embodiment is presented 
in the context of a COBOL and CICS example, the invention may be applied to provide 
connectors from other languages to other transaction targets, such as IMS. Source files 210, 
such as COBOL source files including copybooks 215 for an application 220, are parsed by a 
source code scanner 205 to identify meta information 225, and the meta information 225 is 
stored in a repository 230. The location 235 of the original source files 210, 215 and 240 for 
all of the meta information is also stored in the repository 230 providing linkage 245 back to 
the source files 210 for a given metadatum 225. This repository 230 may then be queried 250 
by a user to determine which source files 210, 215 and 240 comprise the application 220, and 
more specifically, which interfaces 255 comprise the application 220. Using editor-style 
commands 260, the user may construct a new source file 265 that comprises the interface 
definition required by a connector tool 270 to build a connector 280 to the application. This 
may involve selecting 285 and/or modifying 260 a portion out of one file, expanding another 
file (i.e., COPYBOOK) inline, or a combination of the two. This is a non-destructive operation 
in that the original files remain unchanged. An XMI (XML Metadata Interchange) document 
275 is then created that contains the information required by the connector tool 270 in a 
canonical format. The XMI document is able to represent interfaces for many languages such 
as COBOL, PL/I, Assembler, C, C++, and others. The production of the XMI document 275 
allows the connector tool 270 to consume the XMI instead of consuming the actual program 
source files 210, 215, and 240. This relieves the connector tool 270 from the burden of parsing 
multiple different languages and various dialects of those languages. 
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1 preferred in carrying out the preferred embodiment of the present invention. In the flowcharts, 

2 the graphical conventions of a diamond for a test or decision and a rectangle for a process or 

3 function are used. These conventions are well understood by those skilled in the art, and the 

4 flowcharts are sufficient to enable one of ordinary skill to write code in any suitable computer 

5 programming language. 
6 

7 Creating A Connector Builder Project 

8 After the start 305 of the process 300, the user accesses the repository 230, preferably 

9 via a URL (uniform resource locator), which causes process block 310 to present a list of 

10 previously analyzed applications. The items in the presented list are HTML anchors; clicking 

1 1 on any of the HTML anchors selects the application program for which a connector is to be 
L2 created. The user may also run various queries such as transactions within an application, 

US transactions by site, transactions by region, or transactions accessing a particular data store (file 

SI 

yj or database). Filters may also be added to the queries such as a filter to show screen-based 

j|j transactions only or a filter to show LINK transactions only. If the user selects one of these 

tj$ queries, then a list of transactions is presented according to applications, sites, regions, or data 

ess 

L7 stores as appropriate to the query. The user may then select one or more transactions from the 

M list. 

iU 

£9 

Mr 

2€| Process block 315 allows the user to select an application, site, region, or data store 220 

J? for which a connector 280 is to be built. Decision block 320 determines if the selected 

22 application 220 has been previously analyzed. If not, then process block 325 locates the 

23 selected application 220, and process block 330 performs application analysis to parse the 

24 source code 210 of the application program 220 to identify meta information 225. Thereafter, 

25 process block 335 stores in the repository 230 the meta information 225 and a link 235 pointing 

26 to an original location 210 of the meta information within the application program 220. 
27 

28 Identifying Transactions To Be Analyzed 

29 Process block 340 then allows the user to query the repository 230 to determine which 
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source files (210, 215, and 240) and which interfaces 255 comprise the application program 
220. Responsive to the query, process block 345 displays a list 510 of the transactions 
comprising the application program as shown in Figure 5. For each transaction contained in 
the application program 220, a visual indicia 520 is displayed such as an HTML anchor which 
navigates 252 via a link 248 to an entry point of the source code 255 corresponding to the 
transaction to aid the user in determining whether the transaction should be included in the 
connector being developed. As the transaction name may not be sufficient to enable this 
determination, the user may evaluate meta information describing the transaction 254, the 
input/output specification 610 for the transaction, or other information in the source code 
implementation (650 and 660) of the transaction to make this determination as shown in 
Figure 6. Process block 350 may display a visual indicia 530 which navigates to 
documentation stored in the repository 230 corresponding to the transaction. 

For example, the results of a query for CICS ECI transactions may be displayed in the 
form of a list of transactions as shown in Figure 6. For each transaction, the Connector 
Builder Assistant displays: 

o The transaction name (630 and 640) and other information. 

o A "Build Connector" check box or button 540, initially unselected, indicating whether 

or not the user wants the associated transaction included in the connector, 
o A "Show Source" link (520 and 660) leading to a view of the source code of the 
primary entry point of the main program which is started by the transaction which may be 
viewed via an editor or browser. 

o A "Configure" link (670 and 680) leading to the CICS ECI Transaction Configuration 
page, described below. 

o A "Configured" check box, initially unchecked, and automatically turned on if the user 
configures by following the Configure link. The user may also accept a default 
configuration by manually checking this box. The default configuration assumes that a 
DFHCOMMAREA data structure is used as both input and output. It also assumes that 
get and set methods should be built for all elementary data items in a COMMAREA. 
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o A "Name" field in which the user may enter a name for the CBA session. If the CBA 
session was named on a previous page> the name is carried forward onto this page. 

o A "Done" button (550 and 770) causes the CICS ECI Transaction Builder Assistant 
processing, documented below. 

For each transaction, an indication is also displayed as to whether this transaction is a 
LINK transaction, a BMS transaction, or an undetermined type of transaction. A check button 
540 is displayed adjacent each transaction which the user may click to select the transaction for 
inclusion in the connector. For a link transaction, under the transaction name and description 
are displayed one or more HTML anchors 760 representing links 248 stored in the repository 
230 which navigate 252 to the interface 255 associated with the transaction. For a BMS 
transaction, under the transaction name and description are displayed one or more HTML 
anchors which navigate to the BMS map descriptions for the transaction. No I/O HTML 
anchors are displayed for an undetermined transaction. The process then continues to process 
block 415 on Figure 4. This path is illustrated by flowchart connector 355 on Figure 3 and 
flowchart connector 410 on Figure 4. Process block 415 displays other HTML anchors as 
appropriate. 

Identifying Data Structures 
Process block 420 allows the user to select a displayed transaction for inclusion in the 
connector and configuration 290. For each selected transaction 630 and 640, process block 425 
may display a data structure 710, such as the CICS example shown in Figure 7, corresponding 
to the selected transaction 720 to enable the user to choose the selected transaction's input and 
output data structures. Although the invention is presented in the context of this CICS 
example, the invention may be practiced to select transactions and data structures of other 
transaction systems. Figure 7 illustrates a CICS ECI Transaction Configuration page which 
lists all 01 level COBOL data structures which are candidates for an input data structure 780, 
and, as a separate list, all variables which are candidates for an output data structure. The two 
data structure lists are ordered by I/O likelihood. I/O likelihood is computed as follows from 
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highest to lowest: 



o 



DFHCOMMAREA 790 is most likely for input or output. 



o Any 01 structures 795 which REDEFINE or RENAME DFHCOMMAREA and are 

referenced (for input) or set (for output) in the program, 
o Any 01 structures which are MOVEd into from (input) DFHCOMMAREA or which are 

MOVEd to (output) DFHCOMMAREA. 



Within the above categories, the variables are listed alphabetically. The user may select by the 
use of check boxes one or more variables for input, and one or more variables for output. A 
"Configure Fields" link 785 for each data structure leads to a CICS ECI Transaction Field 
Configuration Page which allows the user to select, using a check box, whether a given field is 
to be included or not (for input/output as appropriate). A "Done" button 770 records the user 
configuration entered on this page. 

Process block 425 allows the user to select a data structure 730, and for the selected 
data structure 730, process block 430 computes input and output fields and overlays the fields 
740 on the selected data structure 730. The input and output fields 740 are used by a connector 
builder tool 270 to determine what get and set methods to create on the connector 280. IO 
Interface Analysis computes the IO fields and IO mterface of a given program using the data in 
the repository about variables set and referenced within a callable program and all the programs 
it calls. The IO Interface analysis propagates through CALL, LINK, and XCTL statements 
within the analyzed program. This process may be recursive as the called program may in turn 
call another. Process block 435 allows the user to edit 750 the computed fields 740 if the user 
disagrees with the computed results provided by the IO Interface Analysis; and process block 
440 analyzes the selections and editions to determine if an error exists in the selection. For 
example, if the user selects an undetermined transaction type, then a warning may be given that 
no information will be exported to the connector other than the transaction name. If the user 
selects a transaction whose type is determined, but whose information is incomplete, a warning 



o 



All remaining 01 structures. 



o 



A non-01 level data structures. 
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is given. If the user has selected no transactions, an error is given. 

Creating Analysis And Report Files 
Thereafter, process block 445 constructs a new source file 265 containing the interfaces 
which comprise the application program 220; and stores the new source file 265 and a link 255 
pointing to a location of the new source file 265 in the repository 230. In the preferred 
embodiment, this file is a copybook-like file containing all 01 level data structures which are an 
input or output, as well as input and output information, in a human readable form 295 and/or 
an XML file 275. The user may view the HTML rendered report 295 which lists the 
application name; connectivity information such as that needed to configure TCP62 and the 
ECI client; each transaction of the application by name; and for a LINK transaction, the input 
and output COMMAREAs together with any data element descriptions that may be in the 
analysis repository; or for a BMS transaction, the input and output BMS maps together with 
any screen and data element descriptions that may be in the analysis repository. Additionally, 
if information about the data item usage (input only, output only, or input/output) in the 
transaction is known, it is listed in this report. Process block 450 then constructs a meta 
language document 275 containing a description of the application program interfaces to enable 
a connector building tool 270 to build a connector interface 280 to the application 220. The 
process then ends at process block 455. 

Returning now to decision block 320, if the selected application is found, then 
processing continues to process block 340 which allows the user to query the repository to 
determine which source files and which interfaces comprise the application program. 

Using the foregoing specification, the invention may be implemented using standard 
programming and/or engineering techniques using computer programming software, firmware, 
hardware or any combination or sub-combination thereof. Any such resulting program(s), 
having computer readable program code means, may be embodied within one or more 
computer usable media such as fixed (hard) drives, disk, diskettes, optical disks, magnetic tape, 
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semiconductor memories such as Read-Only Memory (ROM), Programmable Read-Only 
Memory (PROM), etc., or any memory or transmitting device, thereby making a computer 
program product, i.e., an article of manufacture, according to the invention. The article of 
manufacture containing the computer programming code may be made and/or used by 
executing the code directly or indirectly from one medium, by copying the code from one 
medium to another medium, or by transmitting the code over a network. An apparatus for 
making, using, or selling the invention may be one or more processing systems including, but 
not limited to, central processing unit (CPU), memory, storage devices, communication links, 
communication devices, servers, input/output (I/O) devices, or any sub-components or 
individual parts of one or more processing systems, including software, firmware, hardware or 
any combination or sub-combination thereof, which embody the invention as set forth in the 
claims. 

User input may be received from the keyboard, mouse, pen, voice, touch screen, or any 
other means by which a human can input data to a computer, including through other programs 
such as application programs. 

One skilled in the art of computer science will easily be able to combine the software 
created as described with appropriate general purpose or special purpose computer hardware to 
create a computer system and/or computer sub-components embodying the invention and to 
create a computer system and/or computer sub-components for carrying out the method of the 
invention. Although the present invention has been particularly shown and described with 
reference to a preferred embodiment, it should be apparent that modifications and adaptations 
to that embodiment may occur to one skilled in the art without departing from the spirit or 
scope of the present invention as set forth in the following claims. 
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